Temperature in complex networks 
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Various statistical-mechanics approaches to complex networks have been proposed to describe 
expected topological properties in terms of ensemble averages. Here we extend this formalism by 
introducing the fundamental concept of graph temperature, controlling the degree of topological 
optimization of a network. We recover the temperature-dependent version of various important 
models as particular cases of our approach, and show examples where, remarkably, the onset of a 
percolation transition, a scale-free degree distribution, correlations and clustering can be understood 
as natural properties of an optimized (low-temperature) topology. We then apply our formalism to 
real weighted networks and we compute their temperature, finding that various techniques used to 
extract information from complex networks are again particular cases of our approach. 

PACS numbers: 89.75.Hc, 89.75.Fb, 05.70.-a 
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Complex networks have recently attracted much inter- 
est from the community of physicists, due to the possibil- 
ity of applying various techniques generally inspired by 
statistical mechanics in order to highlight universal prop- 
erties P, 0. Among the several approaches that have 
been explored, an interesting one (originally proposed by 
sociologists and termed exponential random graphs 
has been recently explored within an explicit statistical- 
mechanics framework 0, 0, IE IE 01- This powerful 
formalism allows one to treat in a unified fashion a large 
class of models, including random graphs 0,l2ithe con- 
figuration model ^E| J hidden- variable models 0, and 
generalizations of them 0, Q • If we restrict ourselves to 
unweighted networks with a fixed number of vertices TV 
and with no self-loops or multiple edges, each link is re- 
garded as a "particle" that can be placed between any 
two vertices, subject to the constraint that the "occu- 
pation number" of each pair of vertices i,j can only be 
Oij = 0, 1 as in the familiar Fermi statistics. Clearly, aij 
coincides with the entries of the N x N adjacency matrix 
A characterizing the topology completely. Each allowed 
adjacency matrix A corresponds to a possible configura- 
tion, and the set of possible configurations (each with its 
statistical weight Pa) defines the statistical ensemble of 
graphs. 

The above framework allows one to develop the statis- 
tical mechanics of networks by exploiting a range of tools 
which are well known in physics 6]. However, this ther- 
modynamical analogy has not been fully explored yet. 
In particular, the concept of temperature, which is of key 
importance in the study of statistical ensembles, has re- 
ceived little or no attention until now in the literature 
on complex networks. Our aim in the present paper is 
to fully develop the thermodynamical formalism in order 
to include the temperature of a graph explicitly. As we 
report below for several particular cases including the- 
oretical models and empirical data, we find intriguing 



results showing that the temperature allows for an addi- 
tional understanding of complex networks. In particular 
we find that many well-known topological properties such 
as the presence of a giant component, a scale-free degree 
distribution, correlations and clustering can be easily un- 
derstood in terms of the low-temperature behaviour of 
real networks. Our results can also be exploited to com- 
pute the temperature of real networks directly. 

The reason for introducing the temperature of a net- 
work is because in our view the statistical formalism is 
incomplete without it. In all the approaches to expo- 
nential random graphs, the probability Pa depends on 
the energy Ea of the graph A, representing the cost of 
realising A. Now, this concept of cost is unclear with- 
out the assessment of its relative role with respect to 
the available resources that can be exploited to form the 
network. The relative importance of cost and available 
resources is usually controlled in statistical physics by the 
temperature. In the zero-temperature regime the system 
is forced to severe optimization, so that only the least 
costly configuration can be formed and the units of the 
system occupy the states with lowest energy (this is the 
optimized case). In the opposite, infinite-temperature ex- 
treme the system does not distinguish between cheap and 
expensive states, so that all configurations occur with the 
same probability. The formalism that we develop here is 
particularly suitable to model networks subject to such 
economic/engineering constraints. We shall complement 
the standard results obtained in the literature for the 
generic finite-temperature case (which is recovered when 
T = 1) with the interesting ones corresponding to zero 
and infinite temperature, which are not accessible to cur- 
rent finite-temperature models. A range of interesting re- 
sults can be obtained by even the simplest models when 
T is allowed to vary, in particular when T = 0. Our 
approach works equally well for directed as well as undi- 
rected graphs, but for the sake of simplicity we write all 



2 



the expressions for the undirected case only. The gener- 
ahzation to directed graphs is straightforward. 

The most general statistical ensemble for an equilib- 
rium undirected network is a grandcanonical one |9| with 
2^(^-1)72 gj-apj^s having a fixed number of vertices N 
and a varying number of links La — ^-U ' controlled 
by the chemical potential /i. If the temperature is explic- 
itly introduced, the probability of graph A is 



= ^ exp 



Ea 



T 



(1) 



where Z = exp(/iLyi — Ea)/T is the grand partition 
function of the ensemble. Note that when T — > oo we 
have Pa = 2-^(^-i)/2 for all graphs, while when T 
we have Pa — 1 for the graph with the maximum value 
of fiLA — Ea (or Pa — M^^ if there are M degenerate 
such graphs), and Pa = for all other graphs. The 
energy Ea can be in general an arbitrarily complicated 
function of the adjacency matrix A, but throughout the 
present paper we consider the instructive case, explored 
in most models, where it can be written as a sum over 
the individual link energies eij 0,0: Ea = '^ijO'ij^ij- 
This allows us to write 



{A} i] 



(2) 



and Pa = Hij pTj' ~ PijY > where 
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(3) 



is the probability that a link between i and j exists, which 
has the usual form of Fermi statistics (alternative deriva- 
tions of the above form for pij are given in refs.0,0] for 
T = 1). Therefore the additivity of Ea implies that each 
link is drawn with probability pij independently of each 
other. If the form of is further simplified, many im- 
portant network models are obtained as particular cases 
of eq.©, including hidden- variable models, the config- 
uration model and random graphs We shall intro- 
duce the temperature-dependent version of these models 
in what follows. We shall also exploit eq.Q to study 
real networks and compute their temperature empirically. 
Therefore eq.© gives rise to a rich phenomenology and 
will be of central importance throughout the paper. Be- 
fore considering its particular cases, let us first note some 
of its general properties. Note that, independently of T, 
Pij > 1/2 when < fj. and pij < 1/2 when e^j > fi. It is 
interesting to consider the infinite- and zero-temperature 
limits. When T — > -|-oo we have from eq.(|3J) that 
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(4) 



irrespective of the values of and hence of the differ- 
ences in the cost of links. As a consequence, the network 
is a random graph with p = 1/2 and is therefore trivial. 



Note that in this case any two configurations A and B 
become equiprobable {Pa — Pb)- When T = we have 



(5) 



so that only those pairs of vertices with e^j < /i are con- 
nected. This is the well-known degenerate behaviour of 
Fermions at zero temperature, and is also termed the 
Fermi energy ep = fJ-- This clarifies the role of fi as the 
available energy per link when T 0: at absolute zero 
only the topology with the minimum value of Ea — I^-La 
can be realized, and this topology is obtained by draw- 
ing all and only the links with e^j < /i. A final general 
comment is that cq.Q reduces to the "classical" limit 

Py(T) «e(''-^-^)/^ when e^^'^"'^)/^ > 1 (6) 

We now consider various specific cases. The simplest 
scenario is when all link energies are equal: e.y — e. This 
yields a temperature-dependent random graph since all 
probabilities pij are equal to p{T) = [e^^^^)/"^ + 1]^^- 
While the properties of the random graph are well- 
known, in our framework some intriguing results emerge 
as T ^ 0. First of all note that p{0) = 6(/x — e) and 
the graph is either fully connected (/i > e) or empty 
{fi < e). This provides us with a temperature-based def- 
inition of sparseness of a random graph: we can define a 
random graph as sparse {dense) if e > ^ (e < ^) since 
when T — the graph becomes empty (fully connected). 
Then, it is clear that the critical probability pc ~ 1/A^ 
marking the percolation transition with the onset of a gi- 
ant connected component can only be displayed by sparse 
graphs, and if we fix e and fi with e < /i we can regard 
the phase transition as temperature-dependent. In par- 
ticular, there is a critical percolation temperature such 
that p{Tc) — Pc ^ 1/N. Inverting p{T) wc find 
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Therefore when N ^ oo we have the remarkable result 
that the critical percolation temperature tends to zero, 
meaning that the zero-temperature topology naturally sets 
at the critical point p = pc and that at finite tempera- 
ture the network is always above the percolation threshold. 
This result is general: when the etj 's are different we have 

Tmzn <Tc < Tmax whcrC Tmm = {<^rmn - M)/ln7V ^ 

and Tmax — {^max — /i)/ In iV — » as — > OO. Therefore 
Tc —> in this case too, suggesting why large real world 
networks display a giant component. Interestingly, this 
behaviour is similar to a scenario explored in the theory 
of self-organized criticality (SOC) where the relation be- 
tween a vanishing critical temperature and the onset of 
the SOC behaviour has been explored (ll| . 

Another case of interest, showing the surprising effects 
of T, is when each link energy is the sum of two single- 
vertex contributions: = -I- ej. This is the grand- 
canonical version of the configuration model |^ since all 
graphs with the same degree sequence have the same 
Ea and are therefore equiprobable. The novelty of our 
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approach is that the vertex fugacities Xi = e"*^'/^ and 
z = e^l'^ (in terms of which the model is conveniently 
described now depend on T. We have 
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(8) 



The standard procedure to obtain scale-free degree dis- 
tributions in this model is to assign each vertex i a fugac- 
ity Xj drawn from a power-law distribution p[x) ~ x~^ 
[l^. It has been shown that, since pij saturates to 1 as 
Xj +0O, a cut-off appears in the degree distribution 
ensuring that the degrees do not exceed N. This 
is for instance the case of the Internet l^l and of the 
World Trade Web (WTW) [H]. To highlight the role of 
T, it is interesting to rephrase these results in terms of 
the energies e^. For convenience we introduce the ver- 
tex fitness (pi = —ti and we take it to be non- negative 
(this can always be achieved by a shift in the energies 

^ - imax < 0). Therefore (/f^ > (or > 1) 
measures the tendency of vertex i to form connections 
0. Similarly, we define (f>o = —p- Now, if we want 
X to be distributed according to p(x) = (7 — l)a;~^ 
(where 1 < x < +00 and 7 > 1), then the fitness 

= —ei — Tlnxi must be distributed according to 
g(,/,) = e-'^('^"i)/'^(7 - 1)/T. But since does not de- 
pend on T, qicf)) must be T-independent as well. The 
only possibility is therefore (7 — 1)/T = A where A is a 
constant independent of T. Without loss of generality we 
can reabsorbe A in a rescaling of the energy e ^ Ae so 
that we can set A = 1 . This yields 7 = 1 + T and 
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> 0), p{x) ^ Tx 



-l-T 



{x>l) (9) 



which is an important result showing how T deter- 
mines p{x) and consequently the topology of the net- 
work. For instance, in the classical limit (|SJ) we re- 
cover the T-dependent version of a model studied in 
ref.0: since pij « zxiXj, the expected degree ki = 
Vij ~ X] j proportional to Xi and is therefore 
distributed as P(fc) oc k~^~'^ , but there are no degree 
correlations due to the factorization of pij . In the more 
general case, P(k) has a power-law region with an expo- 
nent that is still an increasing function of T, followed by a 
cut-off due to the saturation of pij . The power-law region 
narrows as T increases. This qualitative behaviour can be 
characterized rigorously by computing ki as a function of 
Xi or (j)i, and inverting this relation to find P{k) from p{x) 
or q{(f>). This is not easy in general, but here we show 
that in the three paradigmatic cases T — +00, T — 1 
and T = it can be done successfully. For T = +00 we 
have the usual result pij = 1/2, and P(k) approaches a 
trivial Poisson distribution with mean N/2. For T = 1, 



denoting pij = p{ 



we have 



^ = %(0>').(0')rf0' = A^/ ^ J-t-fl^ 



N'^^''7\+'KNzxlnl±^ 



which is an increasing function of x and is therefore in- 
vertible, even if in a non-algebraic way. If x{k) denotes 
the_ inverse function^ the expected degree distribution is 
P{k) — p[x{k)]dx/dk. Note that k oc x for small x, while 
k ^ N for large x. Thus in the linear regime x (x k while 
dx/dk is constant, implying P{k) oc p[x(fc)] oc k"^. This 
region is followed by a cut-off for large k corresponding 
to the saturated behaviour. Finally, when T = the ex- 
pression for p{x) in eq.© breaks down since all the x^'s 
become infinite, and from eq.© we find 



p,y(O) = e(0, +(/)j-?!)0) 



(11) 



zx 



(10) 



Surprisingly, this coincides with another model intro- 
duced in ref.0, which precisely assumes q{(j)) = e~'^ and 
thus turns out to be the zero-temperature limit of our 
general model. This model is intriguing since, using a 
derivation similar to that in eq. IjlOl) , it is shown 0i 1^ to 
yield a purely scale-free degree distribution P{k) oc 
with no cut-off even if no power-laws are introduced "by 
hand" in it. Moreover, the model displays anticorre- 
lation between degrees: the average nearest neighbour 
degree scales as A:""(fc) oc k~^ and the clustering coeffi- 
cient scales as c(fc) oc k~'^ (times logarithmic corrections) 
0, 13 ■ In our framework it is clear that 0o plays the role 
of a Fermi energy. We can also interpret the correla- 
tions at T = as the collective need to minimise the 
total energy, an effect that gradually weakens as T in- 
creases. Taken together, these intriguing results show 
that in the above model correlated scale-free networks 
with exponent —2 naturally arise as the zero-temperature 
optimized topology. As T grows, the correlations become 
weaker, the exponent of P{k) increases and a cut-off ap- 
pears in it destroying its purely scale-free behaviour, un- 
til for T — > 00 the network becomes uncorrelated with a 
Poisson degree distribution. If one has access to the em- 
pirical distribution p{x), one can measure T for any real 
network which is well described by eq. © . This is possi- 
ble for the WTW, where Xi has been identified with the 
Gross Domestic Product of country z, whose distribution 
has a power-law tail with exponent -2 |l3j |. This means 
that Ty/TW = 1 and that ea. (|10|l applies. This is consis- 
tent with the observed saturated behaviour of k(x) and 
the cut-off displayed by P{k) for the real WTW 

It is possible to further explore eq.© by considering 
different forms of g(0) and of e^- as a function of (pi and 
(j)j, thus recovering the whole class of hidden- variable 
models with generic pij = p{(t)i ,(t>j)- An even more 
general case is when cannot be written as a func- 
tion of single-vertex contributions, so that each pair of 
vertices has an associated link fitness (pij = —tij drawn 
from a distribution (?(0), and a probability pij = p{4>ij) 
to exist. The range of possibilities is very broad, and 
among them we consider a particular case which allows 
for a direct application to real weighted networks and for 
the characterization of their temperature. Very recently 
[1^ we explored the idea that the empirical weights Wij 
in a real weighted network can be mapped into a ma- 
trix of probabilities pij = p{wij) defining an ensemble 
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Network 


7 


r 


Ref. 


Metabolic flux networks 


1.5 


0.5 


[15] 


Interbank contract sizes 


1.87 


0.87 


[16] 


Erdos collaboration network 


2 


1 


[17] 


Chaos control & synchron. coauthorship 


2.5 


1.5 


[17] 


Financial cross-correlations 


2.7 


1.7 




Financial cross-correlations 


2.78 


1.78 


\m 


Financial cross-correlations 


3.18 


2.18 


[19] 


MoUusk research coauthorship 


3.5 


2.5 


[17] 


Unweighted graphs 


+ 00 


+CXJ 





TABLE I: Empirical values of 7 and T for weigthed networks. 



of unweighted graphs, or ensemble network. In such a 
way, many topological properties which are non-obvious 
for weighted networks (such as the clustering coefficient) 
can be re-defined as ensemble avera ges of the correspond- 
ing unweighted quantities. In ref. jl4j we explored the 
simplest possible choice where pij cx Wij. The results 
presented here suggest that this choice is the "classi- 
cal" limit, equivalent to eq.©, of a more general choice 
that we now consider. Turning to cq.(|3Jl, if we require 
Pij = when Wij — and pij = 1 when Wij — +00, 
we find that Wij must be proportional to the link fugac- 
ity e"*^'^/^. In other words, the weights must depend 
on T, which corresponds to the property that at low T 
the least expensive links are the most exploited, while at 
high T all the weights are equal. Now, many real net- 
works 

'T3,'Tl!i3,llllIi display a power-law distribution 
of link weights p(w) oc w~'^ with 1.5 < 7 < 3.5 (see Ta- 
bleQJ. Therefore if we define Xij = Wij/wmin > 1 (where 
Wjnin is the minimum nonzero weight for a given net- 
work), corresponding to the rescaling e,y — + e,y — emaxi 
we can repeat the arguments leading to eq.@ and set 
eij = —Tin Xij < and (pij = —^ij > to obtain the 
same forms of p{x) and q{(i>)- This allows us to com- 
pute the temperature of real networks with power-law 
distributed weights as T = 7-I- 1. The empirical values of 



7 found in various weighted networks El El m El m 

are summarized in Tabled and the corresponding values 
of T (ranging from 0.5 to 2.5) are also shown. By con- 
trast, unweighted networks correspond to T — > 00 where 
Xij — 1 Vi,j. We have therefore found that a general 
mapping from weights to probabilities in the context of 
ensemble networks is given by = zxij/ {\-\-zxij) where 
Xij = Wij/wmin and z = e'^/'^ is a free parameter. We 
note that the classical limit 10 of this expression reads 
Pij = zxij^ and if we choose z — Wmin/wmax we have 
Pij — Wij /wmax, which is approximately equivalent to 
the choice explored by us in ref.E3|- Turning to the full 
expression for pij, when T eq.^ implies that the 
original weighted network is mapped into a deterministic 
unweighted one where only the links with eij < fi are 
drawn. This means that the links with weight such that 
Xij{T) > z{T) in the limit T — > are selected and the 
others are discarded. Interestingly, since the ordering of 
the weights is preserved at all temperatures, this corre- 
sponds to the threshold procedure adopted in ref.jl^ to 
filter stock correlations and in ref. '2 1*1 to extract mini- 
mum spanning trees from real foodwebs. These filtering 
techniques discard most of the information contained in 
the weights, resulting in a single (threshold-dependent) 
unweighted graph. Here we find that this corresponds to 
the zero-temperature limit for an ensemble network. Our 
results allow us to apply these techniques to the finite 
temperature case in order to preserve the heterogeneity 
of the links and exploit it to obtain the whole ensemble 
of possible configurations. 

We have presented a novel approach to complex net- 
works by introducing the concept of temperature. We 
have shown many remarkable temperature-dependent ef- 
fects resulting from topological optimization, and recov- 
ered several seemingly distinct models and techniques as 
particular cases of our unifying approach. This formal- 
ism gives an improved understanding of network topology 
and is open to further investigations. 
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